Probabilistic Tree Transducers for Grammatical Error Correction

نویسنده

  • Jan Moolman Buys
چکیده

We investigate the application of weighted tree transducers to correcting grammatical errors in natural language. Weighted finite-state transducers (FST) have been used successfully in a wide range of natural language processing (NLP) tasks, even though the expressiveness of the linguistic transformations they perform is limited. Recently, there has been an increase in the use of weighted tree transducers and related formalisms that can express syntax-based natural language transformations in a probabilistic setting. The NLP task that we investigate is the automatic correction of grammar errors made by English language learners. In contrast to spelling correction, which can be performed with a very high accuracy, the performance of grammar correction systems is still low for most error types. Commercial grammar correction systems mostly use rule-based methods. The most common approach in recent grammatical error correction research is to use statistical classifiers that make local decisions about the occurrence of specific error types. The approach that we investigate is related to a number of other approaches inspired by statistical machine translation (SMT) or based on language modelling. Corpora of language learner writing annotated with error corrections are used as training data. Our baseline model is a noisy-channel FST model consisting of an n-gram language model and a FST error model, which performs word insertion, deletion and replacement operations. The tree transducer model we use to perform error correction is a weighted top-down tree-to-string transducer, formulated to perform transformations between parse trees of correct sentences and incorrect sentences. Using an algorithm developed for syntax-based SMT, transducer rules are extracted from training data of which the correct version of sentences have been parsed. Rule weights are also estimated from the training data. Hypothesis sentences generated by the tree transducer are reranked using an n-gram language model. We perform experiments to evaluate the performance of different configurations of the proposed models. In our implementation an existing tree transducer toolkit is used. To make decoding time feasible sentences are split into clauses and heuristic pruning is performed during decoding. We consider different modelling choices in the construction of transducer rules. The evaluation of our models is based on precision and recall. Experiments are performed to correct various error types on two learner corpora. The results show that our system is competitive with existing approaches on several error types.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Immediate Grammatical Error Correction on Senior English Majors’ Accuracy at Hebron University

This study aimed at investigating the effects of grammatical error correction on EFL learners’ accuracy. Twenty-two male and female senior students were chosen randomly to respond to a questionnaire investigating their beliefs about immediate grammatical error correction.  Actually, the study was conducted in order to answer this question: what is the effect of grammatical error feedback on stu...

متن کامل

The Impact of Immediate Grammatical Error Correction on Senior English Majors’ Accuracy at Hebron University

This study aimed at investigating the effects of grammatical error correction on EFL learners’ accuracy. Twenty-two male and female senior students were chosen randomly to respond to a questionnaire investigating their beliefs about immediate grammatical error correction.  Actually, the study was conducted in order to answer this question: what is the effect of grammatical error feedback on stu...

متن کامل

Grammatical Error Correction of English as Foreign Language Learners

This study aimed to discover the insight of error correction by implementing two correction systems on three Iranian university students. The three students were invited to write four in-class essays throughout the semester, in which their verb errors and individual-selected errors were corrected using the Code Correction System and the Individual Correction System. At the end of the study, the...

متن کامل

The Effect of Focused Corrective Feedback and Attitude on Grammatical Accuracy: A Study of Iranian EFL Learners

Abstract The study aimed at investigating the efficacy of written corrective feedback (CF) in improving Iranian EFL learners’ grammatical accuracy. It compared the effects of focused and unfocused written CF on the learners’ grammatical accuracy. 75 EFL students formed a one control and two experimental groups. The focused feedback group was provided with error correction in tenses. The unfocus...

متن کامل

The Effect of Focused Corrective Feedback and Attitude on Grammatical Accuracy: A Study of Iranian EFL Learners

Abstract The study aimed at investigating the efficacy of written corrective feedback (CF) in improving Iranian EFL learners’ grammatical accuracy. It compared the effects of focused and unfocused written CF on the learners’ grammatical accuracy. 75 EFL students formed a one control and two experimental groups. The focused feedback group was provided with error correction in tenses. The unfocus...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013